Refining Image Categorization by Exploiting Web Images and General Corpus

نویسندگان

  • Yazhou Yao
  • Jian Zhang
  • Fumin Shen
  • Xian-Sheng Hua
  • Wankou Yang
  • Zhenmin Tang
چکیده

Studies show that refining real-world categories into semantic subcategories contributes to better image modeling and classification. Previous image sub-categorization work relying on labeled images and WordNet’s hierarchy is not only laborintensive, but also restricted to classify images into NOUN subcategories. To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories. The following two major challenges are well studied: 1) noise in the labels of subcategories derived from the general corpus; 2) noise in the labels of images retrieved from the web. Specifically, we first obtain the semantic refinement subcategories from the text perspective and remove the noise by the relevance-based approach. To suppress the search error induced noisy images, we then formulate image selection and classifier learning as a multi-class multi-instance learning problem and propose to solve the employed problem by the cutting-plane algorithm. The experiments show significant performance gains by using the generated data of our way on both image categorization and sub-categorization tasks. The proposed approach also consistently outperforms existing weakly supervised and web-supervised approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Privileged Information from Web Data for Image Categorization

Relevant and irrelevant web images collected by tag-based image retrieval have been employed as loosely labeled training data for learning SVM classifiers for image categorization by only using the visual features. In this work, we propose a new image categorization method by incorporating the textual features extracted from the surrounding textual descriptions (tags, captions, categories, etc....

متن کامل

مرور مؤثر نتایج جستجوی تصاویر با تلخیص بصری و متنوع از طریق خوشه‌بندی

With unprecedented growth in production of digital images and use of multimedia references, requirement of image and subject search has been increased. Systematic processing of this information is a basic prerequisite for effective analysis, organization and management of it. Likewise, large collections of images have been made available on the Web and many search engines have provided the poss...

متن کامل

Speckle Reduction in Synthetic Aperture Radar Images in Wavelet Domain Exploiting Intra-scale and Inter-scale Dependencies

Synthetic Aperture Radar (SAR) images are inherently affected by a multiplicative noise-like phenomenon called speckle, which is indeed the nature of all coherent systems. Speckle decreases the performance of almost all the information extraction methods such as classification, segmentation, and change detection, therefore speckle must be suppressed. Despeckling can be applied by the multilooki...

متن کامل

Search Based Face Annotation Using Weakly Labeled Facial Images

Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. Most of the images are obtained from World Wide Web (WWW), so there is possibility of getting noisy and incomplete images. This study tackle problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of facial images...

متن کامل

Image flip CAPTCHA

The massive and automated access to Web resources through robots has made it essential for Web service providers to make some conclusion about whether the "user" is a human or a robot. A Human Interaction Proof (HIP) like Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) offers a way to make such a distinction. CAPTCHA is a reverse Turing test used by Web serv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1703.05451  شماره 

صفحات  -

تاریخ انتشار 2017